AITopics | mean field algorithm

A Mean Field Algorithm for Bayes Learning in Large Feed-forward Neural Networks

Neural Information Processing SystemsApr-6-2023, 18:03:35 GMT

We present an algorithm which is expected to realise Bayes optimal predictions in large feed-forward networks. It is based on mean field methods developed within statistical mechanics of disordered sys(cid:173) tems. We give a derivation for the single layer perceptron and show that the algorithm also provides a leave-one-out cross-validation test of the predictions.

bayes learning, feed-forward neural network, mean field algorithm, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)

Add feedback

Efficient Bayesian Inference of Sigmoidal Gaussian Cox Processes

Donner, Christian, Opper, Manfred

arXiv.org Machine LearningAug-2-2018

We present an approximate Bayesian inference approach for estimating the intensity of a inhomogeneous Poisson process, where the intensity function is modelled using a Gaussian process (GP) prior via a sigmoid link function. Augmenting the model using a latent marked Poisson process and P\'olya--Gamma random variables we obtain a representation of the likelihood which is conjugate to the GP prior. We approximate the posterior using a free--form mean field approximation together with the framework of sparse GPs. Furthermore, as alternative approximation we suggest a sparse Laplace approximation of the posterior, for which an efficient expectation--maximisation algorithm is derived to find the posterior's mode. Results of both algorithms compare well with exact inference obtained by a Markov Chain Monte Carlo sampler and standard variational Gauss approach, while being one order of magnitude faster.

artificial intelligence, machine learning, poisson process, (15 more...)

arXiv.org Machine Learning

1808.00831

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report (0.64)

Add feedback

Mean-Field Networks

Li, Yujia, Zemel, Richard

arXiv.org Machine LearningOct-21-2014

The mean field algorithm is a widely used approximate inference algorithm for graphical models whose exact inference is intractable. In each iteration of mean field, the approximate marginals for each variable are updated by getting information from the neighbors. This process can be equivalently converted into a feedforward network, with each layer representing one iteration of mean field and with tied weights on all layers. This conversion enables a few natural extensions, e.g. untying the weights in the network. In this paper, we study these mean field networks (MFNs), and use them as inference tools as well as discriminative models. Preliminary experiment results show that MFNs can learn to do inference very efficiently and perform significantly better than mean field as discriminative models.

artificial intelligence, inference, machine learning, (17 more...)

arXiv.org Machine Learning

1410.5884

Country: North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Computing with Finite and Infinite Networks

Winther, Ole

Neural Information Processing SystemsDec-31-2001

Using statistical mechanics results, I calculate learning curves (average generalization error) for Gaussian processes (GPs) and Bayesian neural networks (NNs) used for regression. Applying the results to learning a teacher defined by a two-layer network, I can directly compare GP and Bayesian NN learning.

algorithm, bayes optimal scenario, gaussian process, (13 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Skåne County > Lund (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.51)

Add feedback

Computing with Finite and Infinite Networks

Winther, Ole

Neural Information Processing SystemsDec-31-2001

Using statistical mechanics results, I calculate learning curves (average generalization error) for Gaussian processes (GPs) and Bayesian neural networks (NNs) used for regression. Applying the results to learning a teacher defined by a two-layer network, I can directly compare GP and Bayesian NN learning.

algorithm, bayes optimal scenario, gaussian process, (13 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Skåne County > Lund (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.51)

Add feedback

Computing with Finite and Infinite Networks

Winther, Ole

Neural Information Processing SystemsDec-31-2001

Using statistical mechanics results, I calculate learning curves (average generalization error) for Gaussian processes (GPs) and Bayesian neural networks (NNs) used for regression. Applying the results to learning a teacher defined by a two-layer network, I can directly compare GP and Bayesian NN learning.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: Europe > Sweden (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.51)

Add feedback

A Mean Field Algorithm for Bayes Learning in Large Feed-forward Neural Networks

Opper, Manfred, Winther, Ole

Neural Information Processing SystemsDec-31-1997

In the Bayes approach to statistical inference [Berger, 1985] one assumes that the prior uncertainty about parameters of an unknown data generating mechanism can be encoded in a probability distribution, the so called prior. Using the prior and the likelihood of the data given the parameters, the posterior distribution of the parameters can be derived from Bayes rule. From this posterior, various estimates for functions ofthe parameter, like predictions about unseen data, can be calculated. However, in general, those predictions cannot be realised by specific parameter values, but only by an ensemble average over parameters according to the posterior probability. Hence, exact implementations of Bayes method for neural networks require averages over network parameters which in general can be performed by time consuming 226 M. Opper and O. Winther Monte Carlo procedures.

approximation, equation, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Mean Field Algorithm for Bayes Learning in Large Feed-forward Neural Networks

Opper, Manfred, Winther, Ole

Neural Information Processing SystemsDec-31-1997

In the Bayes approach to statistical inference [Berger, 1985] one assumes that the prior uncertainty about parameters of an unknown data generating mechanism can be encoded in a probability distribution, the so called prior. Using the prior and the likelihood of the data given the parameters, the posterior distribution of the parameters can be derived from Bayes rule. From this posterior, various estimates for functions ofthe parameter, like predictions about unseen data, can be calculated. However, in general, those predictions cannot be realised by specific parameter values, but only by an ensemble average over parameters according to the posterior probability. Hence, exact implementations of Bayes method for neural networks require averages over network parameters which in general can be performed by time consuming 226 M. Opper and O. Winther Monte Carlo procedures.

approximation, equation, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Mean Field Algorithm for Bayes Learning in Large Feed-forward Neural Networks

Opper, Manfred, Winther, Ole

Neural Information Processing SystemsDec-31-1997

In the Bayes approach to statistical inference [Berger, 1985] one assumes that the prior uncertainty about parameters of an unknown data generating mechanism can be encoded in a probability distribution, the so called prior. Using the prior and the likelihood of the data given the parameters, the posterior distribution of the parameters can be derived from Bayes rule. From this posterior, various estimates for functions ofthe parameter, like predictions about unseen data, can be calculated. However, in general, those predictions cannot be realised by specific parameter values, but only by an ensemble average over parameters according to the posterior probability. Hence,exact implementations of Bayes method for neural networks require averages over network parameters which in general can be performed by time consuming 226 M.Opper and O. Winther Monte Carlo procedures.

approximation, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: